Data-driven hypothesis weighting increases detection power in genome-scale multiple testing
نویسندگان
چکیده
منابع مشابه
Data-driven hypothesis weighting increases detection power in multiple testing
Hypothesis weighting is a powerful approach for improving the power of data analyses that employ multiple testing. However, in general it is not evident how to choose the weights in a data-dependent manner. We describe independent hypothesis weighting (IHW), a method for making use of informative covariates that are independent of the test statistic under the null, but informative of each test’...
متن کاملData-driven hypothesis weighting increases detection power in big data analytics
Hypothesis weighting is a powerful approach for improving the power of data analyses that employ multiple testing. However, in general it is not evident how to choose the weights. We describe IHW, a method for data-driven hypothesis weighting that makes use of informative covariates that are independent of the test statistic under the null, but informative of each test’s power or prior probabil...
متن کاملCombining Multiple Hypothesis Testing with Machine Learning Increases the Statistical Power of Genome-wide Association Studies
The standard approach to the analysis of genome-wide association studies (GWAS) is based on testing each position in the genome individually for statistical significance of its association with the phenotype under investigation. To improve the analysis of GWAS, we propose a combination of machine learning and statistical testing that takes correlation structures within the set of SNPs under inv...
متن کاملMultiple Hypothesis Testing in Data Mining
Aalto University, P.O. Box 11000, FI-00076 Aalto www.aalto.fi Author Sami Hanhijärvi Name of the doctoral dissertation Multiple Hypothesis Testing in Data Mining Publisher School of Science Unit Department of Information and Computer Science Series Aalto University publication series DOCTORAL DISSERTATIONS 51/2012 Field of research Computer and Information Science Manuscript submitted 15 June 2...
متن کاملMultiple Hypothesis Testing for Data Mining
INTRODUCTION A number of important problems in data mining can be usefully addressed within the framework of statistical hypothesis testing. However, while the conventional treatment of statistical significance deals with error probabilities at the level of a single variable, practical data mining tasks tend to involve thousands, if not millions , of variables. This Chapter looks at some of the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Nature Methods
سال: 2016
ISSN: 1548-7091,1548-7105
DOI: 10.1038/nmeth.3885